OpenCL™ API 1.0 Quick Reference Card 


The OpenCL Platform Layer 

The OpenCL platform layer which implements platform-specific features that allow applications to query OpenCL devices, device 
configuration information, and to create OpenCL contexts using one or more devices. 


OpenCL (Open Computing Language) is a multi¬ 
vendor open standard for general-purpose parallel 
programming of heterogeneous systems that 
include CPUs, GPUs and other processors. OpenCL 
provides a uniform programming environment for 
software developers to write efficient, portable code 
for high-performance compute servers, desktop 
computer systems and handheld devices. 

[n.n.n] refers to the section in the API Specification 
available at www.khronos.org/opencl. 


The OpenCL Runtime 

Command Queues [ 5 . 1 ] 
cl_command_queue cICreateCommandQueue ( 

cl_context context, cl_device_id device, 
cl_command_queue_properties properties, 
cljnt *errcode_ret) 

properties: CL_QUEUE_PROFILING_ENABLE, 
CL_QUEUE_OUT_OF_ORDER_EXEC_MODE_ ENABLE 

cljnt cIRetainCommandQueue (cl_command_queue 
command_queue) 

cljnt dReleaseCommandQueue (cl_command_queue 
command_queue) 

cljnt dGetCommandQueuelnfo ( 

cl_command_queue command_queue, 
cl_command_queueJnfo param_name, 
sizej param_value_size, void *param_value, 
sizej *param_value_size_ret) 
paramname: CL_QUEUE_CONTEXT, 

CL_QUEUE_DEVICE, 

CL_QUEUE_REFERENCE_COUNT, 

CL_QUEUE_PROPERTIES 

cljnt cISetCommandQueueProperty (d_command_queue 
command_queue, cl_command_queue_properties 
properties, cl_bool enable, 
cl_command_queue_properties *old_properties) 
properties: 

CL_QUEUE_OUT_OF_ORDER_EXEC_MODE_ ENABLE, 
CL_QUEUE_PROFILING_ENABLE 


Memory Objects 

Memory objects include buffer objects, and image objects. Refer 
to the Graphic page for information about image objects. 

A buffer object stores a one-dimensional collection of elements. 
Elements of a buffer object can be a scalar data type (such an 
int, float), vector data type, or a user-defined structure, and are 
stored in sequential fashion and can be accessed using a pointer 
by a kernel executing on a device. The data is stored in the same 
format as it is accessed by the kernel. 

Create Buffer Objects [5.2.1] 
cl_mem cICreateBuffer (cljontext context, 
cl_mem Jlags flags, sizej: size, void *host_ptr, 
cljnt *errcode_ret) 
flags: CL_MEM_READ_WRITE, 

CL_MEM_WRITE_ONLY, 

CL_MEM_READ_ONLY, 

CL_MEM_USE_HOST_PTR, 

CL_M EM_ALLOC_HOST_PTR, 

CL_MEM_COPY_HOST_PTR 


Program Objects 

Create Program Objects [5.4.1] 

d_program dCreateProgramWithSource ( 

cljontext context, cl_uint count, const char **strings, 
const sizej * lengths, cljnt *errcode_ret) 

cl_program cICreateProgramWithBinary ( 

cl jontext context, cl_uint num_devices, 
const cl_device_id * deviceJist, const sizej *lengths, 
const unsigned char **binaries, cljnt * binary_status, 
cljnt *errcode_ret) 

cljnt cIRetainProgram (cl_program program) 
cljnt dReleaseProgram (cl_program program) 

Build Program Executable [5.4.21 
cljnt cIBuildProgram (cjprogram program, 

cl_uint num_devices, const cl_device_id *device_list, 
const char *options, void (*pfn_notify) 

(djrogram, void *user_data), void *user_data ) 


Contexts [4.3] 

cljontext dCreateContext ( 

cl jontext_properties * properties, cl jiint num_devices, 
const cl_device_id *devices, void ( *pfn_notify) 

(const char *errinfo, const void *private_info, sizej cb, 
void *user_data), void *user_data, cljnt *errcode_ret) 
cl_context_properties: CL_CONTEXTPLATFORM, 
CL_GL_CONTEXT_KHR, CL_CGL_SHAREGROUP_KHR, 
CL_EGL_DISPLAY_KHR, CL_GLX_DISPLAY_KHR, 
CL_WGL_HDC_KHR 

cl_context dCreateContextFromType ( 

cl_context_properties * properties, 
cljlevicejype device_type, void (* pfn_notify) 

(const char *errinfo, const void *private_info, 
sizej cb, void *user_data), void *user_data, 
cljnt *errcode_ret) 

cl_context_properties: (same as for cl_create_context) 
cljnt dRetainContext (cljontext context) 
cljnt dReleaseContext (cl jontext context) 
cljnt cIGetContextlnfo (cljontext context, 

cljontextjnfo param_name, sizej param_value_size, 
void *param_value, sizej *param_value_size_ret) 
param_name: CL_CONTEXT_REFERENCE_COUNT, 
CL_CONTEXT_DEVICES, CL_CONTEXT_PROPERTIES 

Querying Platform Info and Devices [41,4.2] 
cljnt dGetPlatformlDs (cl_uint num_entries, 

cljlatformjd *platforms, cl_uint *num_platforms) 
cljnt dGetPlatformlnfo (cljlatformjd platform, 

cl_platform_info param_name, sizej param_value_size, 
void *param_value, sizej *param_value_size_ret) 
param_name: CLPLATFORMJ’ROFILE, 
CL_PLATFORM_VERSION, CL_PLATFORM_NAME, 
CL_PLATFORM_VENDOR, CL_PLATFORMJEXTENSIONS 

cljnt dGetDevicelDs (cljlatformjd platform, 
cl jleviceJype device_type, cl_uint num_entries, 
cl_devicejd * devices, cl_uint *num_devices) 
device_type: CL_DEVICE_TYPE_CPU, CL_DEVICE_TYPE_GPU, 
CL_DEVICE_TYPE_ACCELERATOR, CL_DEVICE_TYPE_DEFAULT, 
CL_DEVICE_TYPE_ALL 


Read, Write, Copy Buffer Objects [5.2.2 -5.2.31 
cljnt dEnqueueReadBuffer ( 

djommand_queue command_queue, cl_mem buffer, 
cl_bool blocking_read, sizej offset, sizej cb, 
void *ptr, cl_uint num_eventsjn_wait_list, 
const cl jvent *event_wait_list, cljvent * event) 

cljnt dEnqueueWriteBuffer ( 

cl jommand_queue command_queue, dmem buffer, 
cl_bool blocking_write, sizej offset, sizej cb, 
const void *ptr, cl_uint num_eventsjn_wait_list, 
const cljvent *event_wait_list, cljvent *event) 

cljnt dEnqueueCopyBuffer ( 

cl jommand_queue command_queue, 

cl_mem src_buffer, cl_mem dst_buffer, sizej src_offset, 

sizej dst_offset, sizej cb, 

cljiint num_events_in_waitjist, 

const cljvent *event_wait_list, cljvent * event) 

cljnt cIRetainMemObject (cl jnem memobj) 
cljnt cIReleaseMemObject (cljnem memobj) 


Build Options [5.4.3] 

Preprocessor options. 

(-D options processed in order listed in cIBuildProgram) 
-D name, 

-D name=definition, 

-I dir 

Math Intrinsics options: 

-cl-single-precision-constant, 

-cl-denorms-are-zero, 

Optimization options: 

-cl-opt-disable, -cl-strict-aliasing, 

-cl-mad-enable, -cl-no-signed-zeros, 

-cl-finite-math-only, -cl-fast-relaxed-math, 

-cl-unsafe-math-optimizations 

Warning request/suppress options: 

-w, -Werror 

Unload the OpenCL Compiler [5.4.4] 
cljnt clUnloadCompiler (void) 


cljnt cIGetDevicelnfo ( 

cl_device_id device, cl_device_info param_name, 
sizej param_value_size, void *param_vaiue, 
sizej *param_value_size_ret) 
param_name: CL_DEVICE_PLATFORM, CL_DEVICE_TYPE, 
CL_DEVICE_VENDOR_ID, 
CL_DEVICE_MAX_COMPUTE_UNITS, 
CL_DEVICE_MAX_WORK_ITEM_DIMENSIONS, 

C L_D E VIC E_M AX_WO R K_ITE M_S IZES, 

CL_DEVICE_MAX_WORK_GROUP_SIZE, 

CL_DEVICE_PREFERRED_VECTOR_WIDTH_CHAR, 

CL_DEVICE_PREFERRED_VECTOR_WIDTH_SHORT, 

CL_DEVICE_PREFERRED_VECTOR_WIDTH_INT, 

CL DEVICE_PREFERRED_VECTOR_WIDTHJONG, 

CL_DEVICE_PREFERRED_VECTOR_WIDTH_FLOAT, 

CL_DEVICE_PREFERRED_VECTOR_WIDTH_DOUBLE, 

CL_DEVICE_MAX_CLOCK_FREQUENCY, 

CL_DEVICE_ADDRESS_BITS, 

CL_DEVICE_MAX_MEM_ALLOC_SIZE, 

CL_DEVICE_IMAGE_SUPPORT, 

CL_DEVICE_MAX_READ_IMAGE ARGS, 

cl_device_max_write_image_args, 

CL_DEVICE_IMAGE2D_MAX (WIDTH | HEIGHT), 
CL_DEVICE_IMAGE3D_MAXJWIDTH | HEIGHT | DEPTH), 
CL_DEVICE_MAX_SAMPLERS, 
CL_DEVICE_MAX_PARAMETER_SIZE, 
CL_DEVICE_MEM_BASE_ADDR_AUGN, 

' CL_DEVICE_MIN_DATA_TYPE_ALIGN_SIZE, 
CL_DEVICE_SINGLE_FP_CONFIG, 
CL_DEVICE_GLOBAL_MEM_CACHE_TYPE, 

CL_D EVICE_G LOBAL_M EM JACH ELI NE_SIZE, 

CL_DEVICE_GLOBAL_MEM CACHE_SIZE, 

CL_DEVICE_GLOBAL_MEM_SIZE, 

CL_DEVICE_MAX_CONSTANT_BUFFER_SIZE, 

CL_DEVICE_MAX_CONSTANT_ARGS, 

CL_DEVICE_LOCAL_MEM_TYPE, 

CL_DEVICE_LOCAL_MEM_SIZE, 

CL_DEVICE_ERROR JORRECTIONSUPPORT, 
CL_DEVICE_PROFILING_TIMER_RESOLUTION, 
CL_DEVICE_ENDIAN_LITTLE, 

CL_DEVICE_AVAILABLE, CL_DEVICE_COMPILER_AVAILABLE, 

CL_DEVICE_EXECUTION_CAPABILITIES, 

CL_DEVICE_QUEUE_PROPERTIES, 

CL_DEVICE_NAME, CL_DEVICE_VENDOR, 

CL_DRIVER_VERSION, CL DEVICE_PROFILE, 

CL_DEVICE_VERSION, CLJEVICE_EXTENSIONS 


Map and Unmap Memory Objects [5.2.8] 
void * clEnqueueMapBuffer ( 

cl jommand_queue command_queue, cl_mem buffer, 
cl_bool blocking_map, cl jnap Jlags map Jlags, 
sizej offset, sizej cb, cl_uint num_events_in_wait_list, 
const cljvent *event_wait_!ist, cljvent * event, 
cljnt *errcode_ret) 

cljnt clEnqueueUnmapMemObject ( 

djommand_queue command_queue, cl_mem memobj, 
void *mapped_ptr, cl jint num_events_in_wait_list, 
const cljvent *event_wait_list, cljvent *event) 

Query Buffer Object [5.2.9] 

cljnt cIGetMemObjectlnfo (dmem memobj, 

cl_memJnfo param_name, sizej param_value_size, 
void *param_value, sizej *param_value_size_ret) 
param_name: CL_MEM_TYPE, 

CL_MEM_FLAGS, CL_MEM_SIZE, 

CL_MEM_HOST_PTR, CL_MEM_MAP_COUNT, 

CL_MEM_REFERENCE_COUNT, CLJV1EM JONTEXT 


Query Program Objects [5.4.5] 
cljnt dGetProgramlnfo (cl jrogram program, 
cljrogramjnfo param_name, 
sizej param_value_size, void *param_value, 
sizej *param_value_size_ret) 
param name: CL_PROGRAM_REFERENCE_COUNT, 
CL_PROGRAM_CONTEXT, 
CL_PROGRAM_NUM_DEVICES, 
CL_PROGRAM_DEVICES, 

CL_PROGRAM_SOURCE, 

CL_PROGRAM_BINARY_SIZES, 

CL_PROGRAM_BINARIES, 

CL_PROGRAM_SOURCE 

cljnt dGetProgramBuildlnfo (cl jrogram program, 
cl_device_id device, 
cl_program_buildJnfo param_name, 
sizej param_value_size, 

void *param_value, sizej *param_value_size_ret) 
param name: CL_PROGRAM_BUILD_STATUS, 
CL_PROGRAM_BUILD_OPTIONS, 
CL_PROGRAM_BUILDJOG 
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Kernel and Event Objects 

Create Kernel Queries [5.5.1] 
d_kernel dCreateKernel (cl program program, 
const char *kernel_name, cl_int *errcode_ret) 

cljnt cICreateKernelsInProgram (cl_program program, 

| cl_uint num_kernels, cl_kernel *kernels, 
cl_uint *num_kernels_ret) 

cljnt dRetainKernel (d_kernel kernel) 
cljnt cIReleaseKernel (cl kernel kernel) 

Kernel Arguments & Object Queries [5.5.2,5.5.3] 
cljnt cISetKernelArg (cl_kernel kernel, cl_uint arg index, 
size_t arg_size, const void *arg_value) 

cljnt dGetKernellnfo (cl_kernel kernel, 

cl_kernel_info param_name, size_t param_value_size, 
void *param_value, size_t *param_value_size_ret) 
param_name: CL_KERNEL_FUNCTION_NAME, 
CL_KERNEL_NUM_ARGS, 

CL_KERNEL_REFERENCE_COUNT, 

CL_KERNEL_CONTEXT, CL_KERNEL_PROGRAM 

cl_int dGetKernelWorkGroupInfo (cl kernel kernel, 
cl_device_id device, 

cl_kernel_work_groupjnfo param_name, 
size_t param_value_size, 

void *param_value, size_t *param_value_size_ret) 
param_name: CL_KERNEL_WORK_GROUP_SIZE, 
CL_KERNEL_COMPILE_WORK_GROUP_SIZE, 
CL_KERNEL_LOCAL_MEM_SIZE 


Execute Kernels [5.6] 
cljnt dEnqueueNDRangeKernel ( 

d_command_queue command_queue, cl_kernel kernel, 

cl_uint work_dim, const size_t *global_work_offset, 

const size t *global_work_size, 

const size t *local_work_size, 

cl_uint num_events_in_waitjist, 

const cl_event *event_wait_list, devent *event) 

cljnt dEnqueueTask ( 

cl_command_queue commandqueue, cl_kernel kernel, 

cl_uint num_eventsjn_wait_list, 

const cl_event *event_wait_list, cl_event * event) 

cljnt dEnqueueNativeKernel (cl_command_queue 
command_queue, void (*user Junc)(v oid *), 
void *args, size_t cb_args, cl_uint num_mem_objects, 
const cl_mem *mem_list, const void **args_mem_loc, 
cl_ui nt num_ events_in_ waitjist, 
const d_event *event_wait_list, cl event *event) 

Event Objects [5.7] 
cljnt clWaitForEvents ( 

cl_uint num_events, const cl event *event_list ) 

cljnt dGetEventlnfo ( 

cl event event, cl_event_info param_name, 
size t param_value_size, void *param_value, 
size_t *param_value_size_ret) 

cljnt dRetainEvent (clevent event) 
cljnt dReleaseEvent (clevent event) 


Out-of-order Execution of Kernels 
& Memory Object Commands [5.8] 
cljnt clEnqueueMarker ( 

cl_command_queue command_queue, cl_event *event) 

cljnt clEnqueueWaitForEvents ( 

d_command_queue command_queue, 
cl_uint num_events, const cl_event *event_list) 

cljnt dEnqueueBarrier ( 

cl_command_queue command_queue) 

Profile Operations on Memory Objects 
& Kernels [5.9] 

cljnt cIGetEventProfilinglnfo (cl_event event, 
cl_profilingJnfo param_name, 
size_t param_value_size, void *param_value, 
sizej *param_value_size_ret) 
param_name: CL_PROFILING_COMMAND_QUEUED, 
CL_PROFILING_COMMAND_SUBMIT, 
CL_PROFIUNG_COMMAND_START, 
CL_PROFILING_COMMAND_END 

Flush and Finish [5.10] 
cljnt dFlush (cl_command_queue command_queue) 
cljnt dFinish (cl_command_queue command_queue) 
param_name: CL_EVENT_COMMAND_QUEUE, 
CL_EVENT_COMMAND_TYPE, 
CL_EVENT_COMMAND_EXECUTION_STATUS, 
CL_EVENT_REFERENCE_COUNT 


Supported Data Types 


Built-in Vector Data Types [6.1.2] 


Reserved Data Types [6.1.4] 



Vector Component Addressing [6.1.7] 

The components of a vector may be addressed as shown below or as shown in the table of equivalencies. 

Vector Components 


Vector Addressing Equivalencies 



0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

float2 v; 

v.x, 

v.sO 

v.y, 

v.sl 















float4 v; 

v.x, 

v.sO 

v.y, 

v.sl 

v.z, 

v.s2 

v.w, 

v.s3 













float8 v; 

v.sO 

v.sl 

v.s2 

v.s3 

v.s4 

v.s5 

v.s6 

v.s7 









floatl6 v; 

v.sO 

v.sl 

v.s2 

v.s3 

v.s4 

v.s5 

v.s6 

v.s7 

v.s8 

v.s9 

v.sa, 

v.sA 

v.sb, 

v.sB 

v.sc, 

v.sC 

v.sd, 

v.sD 

v.se, 

v.sE 

v.sf, 

v.sF 



v.lo 

v.hi 

v.odd 

v.even 

float2 

v.x, v.sO 

V -Y< v.sl 

v.y, v.sl 

v.x, v.sO 

float4 

v.sOl, v.xy 

v.s23, v.zw 

v.sl3, v.yw 

v.s02, v.xz 

float8 

v.s0123 

V.S4567 

v.sl357 

V.S0246 

floatlG 

V.S01234567 

v.s89abcdef 

v.sl3579bdf 

v.s02468ace 


When addressing vector components by numeric indices, they must be 
preceded by the letter s or S, e.g.: si. 

Swizzling, duplication, and nesting are allowed, e.g.: v.yx, v.xx, v.lo.x 


Conversions and Type Casting Examples 

[ 6 . 2 ] 

Ta = (T)b; // Scalar to scalar, or scalar to vector 
Ta = convert_T(fa); 

Ta = convert_7"_/?(b); 

Ta = convert_T_sat_/?(b); 

Ta = as_T(b); 


Rounding Modes [6 2.3.2] 

R can be: 

_rte Round to nearest even 
_rtz Round toward zero 
_rtp Round toward positive infinity 
_rtn Round toward negative infinity 


Operators [6.3] 

These operators behave similarly as in C99 except 
that operands may include vector types when 
possible: 

+ - * % / --++== != & 

' A > <>=<=| ! && || 
» « , = op= sizeof 


Address Space Qualifiers [6.5] 

_global, global _local, local 

_constant, constant _private, private 

Function Qualifiers [6.7] 

_kernel, kernel 

_attribute_((vec_type_hint(int)}) 

_attribute_((work_group_size_hint(X, Y, Z))) 

_attribute_((reqd_work_group_size(X, Y, Z))) 


Preprocessor Directives & Macros [6.9] 

#pragma OPENCL FP_CONTRACT on-off-switch 
on-off-switch: ON, OFF, DEFAULT 

Predefined Macro Names 


_FILE_ 

_LINE_ 

_OPENCL_VERSION_ 

_ENDIAN_LITTLE_ 
_ROUNDING_MODE_ 

_kernel_exec(X, typen) 


_IMAGE_SUPPORT_ 

_FAST_RELAXED_MATH_ 


Current source file 
Line number 
Integer version number 
1 if device is little endian 
Current rounding mode 
(default "rte") 

Same as:_kernel_attribute_( 

(work_group_size_hint(X, 1,1))) \ 

_attribute_((vec_type_hint(typen))) 

1 if images are supported, 

1 if -cl-fast-relaxed-math optimization 
option is specified 









































































































OpenCL™ API 1.0 Quick Reference Card 


Work-Item Built-in Functions [6.11.1] 

O is dimension index. 


uintget work dim () 

Num. of dimensions in use 

size t get global size (uint D) 

Num. of global work-items 

sizejget global id (uint D) 

Global work-item ID value 

size t get local size (uint D) 

Num. of local work-items 

size t get Jocaljd (uint D) 

Local work-item ID 

size t get num groups (uint D) 

Num. of work-groups 

size t get group id (uint D) 

Returns the work-group ID 


Floating Point Math Constants [6.11.2] 

MAXFLOAT Value of maximum non-infinite single¬ 
precision floating-point number. 

HUGE_VALF Positive float constant expression. 

HUGE_VALF evaluates to +infinity. Used 
as an error value. 

INFINITY Constant expression of type float 
representing positive or unsigned 
infinity. 

NAN Constant expression of type float 

representing a quiet NaN. 


integer Built-in Functions [6.11.3] 

T is type char, charn, uchar, ucharn, short, shortn, ushort, 
ushortn, int, into, uint, uintn, long, longn, ulong, or ulongn. 
U refers to the unsigned version of I 


Uabs(Tx) 

1*1 

U abs diff (Tx, Ty) 

| x-y | without modulo overflow 

Tadd sat(Tx, Ty) 

x + y and saturates the result 

ThaddfTx, Ty) 

(x + y)» 1 without modulo 
overflow 

T rhadd (Tx,Ty) 

(x+y +1)»1 

T clz (Tx) 

Number of leading 0-bits in x 

T mad hi (T a,Tb,T c) 

mul hi(o, b) + c 

T mad24 (T a,Tb,T c) 

Multiply 24-bit integer values a 
and b and add the 32-bit integer 
result to 32-bit integer c 

T mad sat (T a, Tb,Tc) 

a * b + c and saturates the result 

TmaxfTx, Ty) 

y if x < y, otherwise it returns x 


T min (Tx, Ty) 

y if y < x, otherwise it returns x 

Tmul_hi(Tx, Ty) 

high half of the product of x and y 

T mul24 (Ta, Tb) 

Multiply 24-bit integer values 
oandb 

T rotate (Tv, Ti) 

result[/ndx] = v[/ndx]« i{indx] 

Tsub sat(Tx, Ty) 

x - y and saturates the result 

For upsample, scalar types are pi 

shortn upsample (charn hi, 
ucharn to) 

ushortn upsample (ucharn hi, 
ucharn to) 

intn upsample (shortn hi, 
ushortn to) 

uintn upsample (ushortn hi, 
ushortn to) 

longn upsample (intn hi, 
uintn to) 

ulongn upsample (uintn hi, 
uintn to) 

emitted for the vector types below. 
result[/]= ((short)W[/]« 8) | lo[i] 

result[/]=((ushort)W[/]« 8) | lo[i) 

result[/]=((int)to[/]« 16) | lo[i] 

result[/]=((uint)h/[/]« 16) | lo[i] 

result[/]=((long)/)/[/]« 32) | lo[i] 

resu It [/]=( (ulong)W[/]« 32) | lo[i) 


Common Built-in Functions [6.11.4] 

T is type float or floatn (or optionally double, doublen, or halfn). 

Optional extensions enable double, doublen, and halfn types. 


T clamp (Tx, T min, T max) 
floatn clamp (floatn x, float min, float max) 
doublen clamp (doublen x, double min, double max) 
halfn clamp (halfn x, half min, half max) 

Clamp x to 
range given 
by min, max 

T degrees (T radians) 

radians to 
degrees 

T max (Tx, Ty) 
floatn max (floatn x, float y) 
doublen max (doublen x, double y) 
halfn max (halfn x, half y) 

Max of x 
andy 

T min (Tx, Ty) 
floatn min (floatn x, float y) 
doublen min (doublen x, double y) 
halfn min (halfn x, half y) 

Min of x 
and y 

7mix(Tx, Ty) 
floatn mix (floatn x, float y) 
doublen mix (doublen x, double y) 
halfn mix (halfn x, half y) 

Linear blend 
of x andy 

T radians (T degrees) 

degrees to 
radians 

T step (T edge, T x) 
floatn step (float edge, floatn x) 
doublen step (double edge, doublen x) 
halfn step (half edge, halfn x) 

0.0 if x < 
edge, else 

1.0 

T smoothstep [TedgeO, Tedgel, T x) 
floatn smoothstep (float edgeO, float edgel, floatn x) 
doublen smoothstep (double edgeO, double edgel, 
doublen x) 

halfn smoothstep (half edgeO, half edgel, halfn x) 

Step and 
interpolate 

T sign (Tx) 

Sign ofx 


Math Built-in Functions [6.11.2] 

TexplO(T) HN 

Exponential base 10 


float nan (uintn nancode) 

Quiet NaN 

7" is type float or floatn (or optionally double, doublen, or halfn). 
intn, uintn, and ulongn must be scalar when T is scalar. The 
symbol HN indicates that Half and Native variants are available 
by prepending "half_" or "native_" to the function name, as in 


Texpml(Tx) 

e A x-1.0 


floatn nan (uintn nancode) 
halfn nan (ushortn nancode) 
doublen nan (ulongn nancode ) 
doublen nan (uintn nancode) 



Tfabs(T) 

Absolute value 




Tfdim (Tx, Ty) 

"Positive difference" between x and y 



halt_cos() and native_cos(). Optional extensions enable double, 

T floor (7) 

Round to integer toward - infinity 


T nextafterf Tx,Ty) 

Next representable floating-point 

uuuuicf/, diiu iidii n lype*. 


Tfma(To,Tb,Tc) 

Multiply and add, then round 


value following x in the direction of y 

T acos (T) 

Arc cosine 


T fmax (Tx, Ty) 
halfn fmax (halfn x, half) 

Return y if x<y, 
otherwise it returns x 


Tpow(Tx, Ty) 

Compute x to the power of y (x A y) 

Tacosh (T) 

Inverse hyperbolic cosine 



T pown (Tx, intn y) 

Compute x A y, where y is an integer 

T acospi (I x) 

acos (x) / ti 


Tfmin (Tx, Ty) 

Return y if y<x, 


Tpowr (Tx,Ty) HN 

Compute x A y, where x is >= 0 

Tasin(T) 

Arc sine 


halfn fmin (halfn x, half) 

otherwise it returns x 


Thalf_recip(7x) 

Tnative recip(Tx) 

1/x 

Tasinh(T) 

Inverse hyperbolic sine 


Tfmod (Tx, Ty) 

Modulus. Returns x-y * trunc (x/y) 


fasinpi (Tx) 

asin (x) / 7t 


T fract (Tx, T *iptr) 

Fractional value inx 


Tremainder (Tx, Ty) 

Floating point remainder function 

T atan (Ty over x) 

Arc tangent 


Tfrexp (Tx, intn *exp) 

Extract mantissa and exponent 


TremquofTx, Ty, intn 

Floating point remainder and 

7atan2 (Ty, Tx) 

Arc tangent of y/x 


T hypot (Tx, Ty) 

square root of x A 2+ y A 2 


*quo) 

quotient function 

T atanh (7) 

Hyperbolic arc tangent 


intn ilogb (Tx) 

Return exponent as an integer value 


T rint (T) 

Round integer to nearest even 

Tatanpi (Tx) 

atan (x) /ti 


Tldexp(Tx, intnn) 

x * 2 A n 



integer 



T rootn (Tx, intn y) 

Compute x to the power of 1/y 

Tatan2pi (Tx,Ty) 

atan2 (x, y)/7i 


Tldexp(Tx, int n) 



T cbrt (7) 

cube root 


Tlgamma (Tx) 

T lgamma r (T x, intn *signp) 

Log gamma function 


Tround (Tx) 

Integral value nearest to x rounding 



T rsqrt (T) HN 

Inverse square root 

T ceil (T) 

Round to integer toward + infinity 





7 log (7) HN 

Natural logarithm 


T sin (7) HN 


Tcopysign (Tx, Ty) 

x with sign changed to sign of y 



sine 


7log2 (T) HN 

Base 2 logarithm 


Tsincos (Tx, T *cosval) 

sine and cosine ofx 

Tcos(T) HN 

cosine 



T cosh ( T) 

hyperbolic consine 


TloglO (T) HN 

Base 10 logarithm 


Tsinh(T) 

hyperbolic sine 

Tcospi(Tx) 

cos (n x) 


Tloglp(Tx) 

In (1.0 + x) 


Tsinpi (Tx) 

sin (ti x) 

Thalf_divide (Tx,Ty) 

T native divide (Tx, Ty) 

x/y 


Tlogb(Tx) 

exponent ofx 


Tsqrt(T) HN 

square root 


T mad ( Ta,Tb,Tc) 

Approximates a* b + c 


Ttan (T) HN 

tangent 

Terfc(T) 

Complementary error function 


T modf (Tx, T*iptr) 

Decompose a floating-point number 


Ttanh (T) 

hyperbolic tangent 

T erf (T) 

Calculates error function of T 



Ttanpi (Tx) 

tan (nx) 

T exp (Tx) HN 

Exponential base e 



Ttgamma ( T) 

gamma function 

T exp2 (T) HN 

Exponential base 2 



T trunc (T) 

Round to integer toward zero 


Geometric Built-in Functions [6.11.5] 

Vector types may have 2 or 4 components. Optional extensions 
enable double, doublen, and halfn types. 


float4 cross (float4 pO, float4 pi) 
double4 cross (double4 pO, double4 pi) 
half4 cross (half4 pO, half4 pi) 

Cross product 

float dot (float pO, float pi) 
float dot (floatn pO, floatn pi) 
double dot (double pO, double pi) 
double dot (doublen pO, doublen pi) 
half dot (half pO, half pi) 
half dot (halfn pO, halfn pi) 

Dot product 


float distance (float pO, float pi) 
float distance (floatn pO, floatn pi) 
double distance (double pO, double pi) 
double distance (doublen pO, doublen pi) 
half distance (half pO, half pi) 
half distance (halfn pO, halfn pi) 

Vector distance 

float length (float p) 
float length (floatn p) 
double length (double p) 
double length (doublen p) 
half length (half p) 
half length (halfn p) 

Vector length 


float normalize (float p) 
floatn normalize (floatn p) 
double normalize (double p) 
doublen normalize (doublen p) 
half normalize (halfp) 
halfn normalize (halfn p) 

Normal vector length 1 

float fast_distance (float pO, float pi) 
float fast distance (floatn pO, floatn pi) 

Vector distance 

float fastjength (float p) 
float fastjength (floatn p) 

Vector length 

float fast_normalize (float p) 
floatn fast normalize (floatn p) 

Normal vector 
length 1 




































































































































OpenCL™ API 1.0 Quick Reference Card 


Relational Built-in Functions [e.n.6] 

Tis type float, floatn, char, charn, uchar, ucharn, short, short/i, 
ushort, ushortn, int, intn, uint, uintn, long, longn, ulong, or 
ulongn (and optionally double, doublen). S is type char, charn, 
short, shortn, int, intn, long, or longn. U is type uchar, ucharn, 
ushort, ushortn, uint, uintn, ulong, or ulongn. Optional 
extensions enable double, doublen, and halfn types. 


int isequal (float x, float y) 
intn isequal (floatn x, floatn y) 
int isequal (double x, double y) 
longn isequal (doublen x, doublen y) 
int isequal (halfx, half/) 
shortn isequal (halfn x, halfn y) 

Compare of x==y 

int isnotequal (float x, float y) 
intn isnotequal (floatn x, floatn y) 
int isnotequal (double x, double y) 
longn isnotequal (doublen x, doublen y) 
int isnotequal (half x, half y) 
shortn isnotequal (halfn x, halfn y) 

Compare of x != y 

int isgreater (float x, float y) 
intn isgreater (floatn x, floatn y) 
int isgreater (double x, double y) 
longn isgreater (doublen x, doublen y) 
int isgreater (half x, half y) 
shortn isgreater (halfn x, halfn y) 

Compare of x>y 

int isgreaterequal (float x, float y) 
intn isgreaterequal (floatn x, floatn y) 
int isgreaterequal (double x, double y) 
longn isgreaterequal (doublen x, doublen y) 
int isgreaterequal (half x, half y) 
shortn isgreaterequal (halfn x, halfn y) 

Compare of x >= y 

int isless(floatx,floaty) 
intn isless (floatn x, floatn y) 
int isless (double x, double y) 
longn isless (doublen x, doublen y) 
int isless (half x, half y) 
shortn isless (halfn x, halfn y) 

Compare of x<y 

int islessequal (float x, float y) 
intn islessequal (floatn x, floatn y) 
int islessequal (double x, double y) 
longn islessequal (doublen x, doublen y) 
int islessequal (half x, half y) 
shortn islessequal (halfn x, halfn y) 

Compare of x<=y 

int islessgreater (float x, float y) 
intn islessgreater (floatn x, floatn y) 
int islessgreater (double x, double y) 
longn islessgreater (doublen x, doublen y) 
int islessgreater (half x, half y) 
shortn islessgreater (halfn x, halfn y) 

Compare of 
(x < y) 11 (x > y) 

int isfinite (float) 
intn isfinite (floatn) 
int isfinite (double) 
longn isfinite (doublen) 
int isfinite (half) 
shortn isfinite (halfn) 

Test for finite 
value 


int isinf (float) 
intn isinf (floatn) 
int isinf (double) 
longn isinf (doublen) 
int isinf (half) 
shortn isinf (halfn) 

Test for +ve or 
-ve infinity 

int isnan (float) 
intn isnan (floatn) 
int isnan (double) 
longn isnan (doublen) 
int isnan (half) 
shortn isnan (halfn) 

Test for a NaN 

int isnormal (float) 
intn isnormal (floatn) 
int isnormal (double) 
longn isnormal (doublen) 
int isnormal (half) 
shortn isnormal (halfn) 

Test for a normal 
value 

int isordered (float x, float y) 
intn isordered (floatn x, floatn y) 
int isordered (double x, double y) 
longn isordered (doublen x, doublen y) 
int isordered (half x, half y) 
shortn isordered (halfn x, halfn y) 

Test if arguments 
are ordered 

int isunordered (float x, float y) 
intn isunordered (floatn x, floatn y) 
int isunordered (double x, double y) 
longn isunordered (doublen x, doublen y) 
int isunordered (half x, half y) 
shortn isunordered (halfn x, halfn y) 

Test if arguments 
are unordered 

int signbit (float) 
intn signbit (floatn) 
int signbit (double) 
longn signbit (doublen) 
int signbit (half) 
shortn signbit (halfn) 

Test for sign bit 

int any (S x) 

1 if MSB in any 
component of x is 
set; else 0 

int all (Sx) 

1 if MSB in all 
components of x are 
set; else 0 

T bitselect (T a,Tb,T c) 
halfn bitselect (halfn 0, halfn b, halfn c) 
doublen bitselect (doublen a, 
doublen b, doublen c) 

Each bit of result is 
corresponding bit of 

0 if corresponding bit 
ofcisO 

T select (Ta, Tb, Sc) 

T select (T a, Tb,Uc) 

doublen select (doublen, doublen, 

longn) 

For each component 
of a vector type, 
result!/] = if MSB of 
c[i] is set ? b[i\ : o[/] 
For scalar type, 
result = c ? b : a 


Optional Extension: Atomic Functions [9.5] 

Q is qualifier_global or_local. T is type int or unsigned int 

for 32-bit atomic functions. T is type long or ulong for 64-bit 
atomic functions. 

To use the base or extended atomic functions, include this 
pragma in your application: 

#pragma OPENCL EXTENSION extension-name : enable 

For base atomic functions, extension-name is one of: 
cl_khr_global_int32_base_atomics 
cl_kh r Joca I_int32_base_atomics 
cl_khr_int64_base_atomics 

For extended atomic functions, extension-name is one of: 
cl_khr_globalJnt32_extended_atomics 
cl_khr_local_int32_extended_atomics 
cl_khr_int64_extended_atomics 


Base atomic functions 

T atom add ( QT*p,Tval ) 

Read, add, and store 

Tatom sub (QT*p,Tval) 

Read, sub, and store 

T atom xchg (Q T *p, T val) 

Read, swap, and store 

Tatomjnc (QT*p) 

Read, increment, and store 

T atom dec ( QT*p ) 

Read, decrement, and store 

T atom_cmpxchg (Q T*p, 

T cmp, T val) 

Read and store (*p —cmp) 

? val: *p 


Extended atomic functions 

T atom min (Q T*p, T val) 

Read, store min(*p, val) 

T atom max (Q T *p, T val) 

Read, store max(*p, val) 

T atom and (Q T*p, Tval) 

Read, store (*p & val) 

Tatom or (QT*p,Tval) 

Read, store (*p \ val) 

Tatom xor ( QT*p,Tval ) 

Read, store (*p A val) 


Vector Data Load/Store Built-in Functions [6.11.7] 

Q is an Address Space Qualifier listed in 6.5 unless otherwise noted. 
R defaults to the current rounding mode, or is one of the Rounding 
Modes listed in 6.23.2. Tis type char, uchar, short, ushort, int, uint, 
long, ulong, half, or float (or optionally double). Tn refers to the 
vector form of type T. Optional extension enables the double and 
doublen types. 


Tn vloadn (size_t offset, const Q T *p) 

Read vector data from 
memory 

void vstoren (Tn data, size t offset, 
QT*p) 

Write vector data to memory 
(Q in this function 
cannot be constant) 

float vload_half (size_t offset, 
const Qhalf *p) 

Read a half from memory 

float/? vload_halfn (size_t offset, 
const Q half *p) 

Read multiple halfs from 
memory 

void vstore_half (float data, 
size_t offset, Q half *p) 
void vstore_half_ft (float data, 
size_t offset, Q half *p) 
void vstorejialf (double data, 
size_t offset, Q half *p ) 
void vstorejialfj? (double data, 
size t offset, Q half *p) 

Write a half to memory 

(Q in this function 
cannot be_constant) 

void vstore_halfn (floatn data, 
size_t offset, Q half *p) 
void vstore_halfnJ? (floatn data, 
size_t offset, Q half *p) 
void vstore_halfn (doublen data, 
size_t offset, Q half *p) 
void vstore_halfn_R (doublen data, 
size t offset, Q half *p) 

Write a half vector to 
memory 

(Q in this function 
cannot be _constant) 

floatn vloada_halfn (size_t offset, 
const Q half *p) 

sizeof (floatn) bytes of data 
read from location 
(p +(offset *n)) 

void vstorea_halfn (floatn data, 
size_t offset, Q half *p) 
void vstorea_halfnJ? (floatn data, 
size_t offset, Q half *p) 
void vstorea_halfn (doublen data, 
size_t offset, Q half *p) 
void vstorea_halfnJ? (doublen data, 
size_t offset, Q half *p) 

Write a half vector to 
vector-aligned memory 

(Q in this function 
cannot be _constant) 


Async Copies and Prefetch Built-in Functions [6.11.11] 

Tis type char, charn, uchar, ucharn, short, shortn, ushort, ushortn, 
int, intn, uint, uintn, long, longn, ulong, ulongn, float, floatn, and 
optionally double, doublen. Optional extension enables the double 
and doublen types. 


event_t async_work_group_copy 

(_local T *dst, const global 

T *src, size_t num_elements, 
event_t event) 

event _t async_work_group_copy 

(—global T *dst, const_local 

T *src, size_t num_elements, 
event t event) 

Copies T elements from src to dst 

void wait_group_events (int 
num_events, event_t 
*event list) 

Wait for events that identify 
the async_work_group_copy 
operations to complete. 

void prefetch (const global 

T *p, size_t num_elements) 

Prefetch num_elements * sizeof(7) 
bytes into the global cache. 


Synchronization and Explicit Memory Fence 
Built-in Functions [6.11.9,6.11.10] 

The flags argument specifies the memory address space and 
can be set to a combination of CLK_LOCAL_MEM_FENCE and 
CLK_GLOBAL_MEM_FENCE. 


void barrier ( 

d mem fence flags flags) 

All work-items in a work-group must 
execute this before any can continue 

void memjence ( 

cl mem fence flags flags) 

Orders loads and stores of a work- 
item executing a kernel 

void read_mem_fence ( 

cl mem fence flags flags) 

Orders memory loads 

void write_mem_fence ( 

cl mem fence flags flags) 

Orders memory stores 
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OpenCL™ API 1.0 Quick Reference Card: Graphics 


Following is a quick reference to the subset of the OpenCL API specification that pertains to 
graphics, [n.n.n] refers to the section in the full specification, which is available at 
www.khronos.org/opencl. 


Image Objects 

Create Image Objects [5.2.4] 
cl_mem clCreatelmage2D ( 

cl_context context, cl_mem_flags flags, 
const cl_image_format *image_format, 
size_t image_width, size_t imagejheight, 
size_t image_row_pitch, void *host_ptr, 
cljnt *errcode_ret) 

flags: CLJ/IEMREADWRITE, CL MEM WRITE ONLY, 

CL_MEM_READ_ONLY, CL_MEM_USE_HOST_PTR, 

CL_MEM_ALLOC_HOST_PTR, a_MEM_COPY_HOST_PTR 

cl_mem clCreatelmage3D ( 

cl_context context, cl_mem_flags flags, 
const d_image_format *imageJormat, 
size_t image_width, size_t image_height, 
size_t image_depth, size_t image_row_pitch, 
size_t image_slice_pitch, void *host_ptr, 
cljnt *errcode_ret) 

flags: CL_MEM_READ_WRITE, CL_MEM_WRITE_ONLY, 

CL_MEM_READ_ONLY, CL_MEM_USE_HOST_PTR, 

CL_M EM_ALLOC_HOST_PTR, CL_MEM_COPY_HOST_PTR 

Query List of Supported Image Formats [sas] 

cljnt cIGetSupportedlmageFormats ( 

cl_context context, cl_mem_flags flags, 
d_mem_object_type image_type, 
cl_uint num_entries, 
cljmage Jormat *imageJormats, 
cl_uint *num_image_formats) 
flags: CL_MEM_READ_WRITE, CL_MEM_WRITE_ONLY, 

CL_M E M_READ_ONLY, CL_MEM_USE_HOST_PTR, 

CL_MEM_ALLOC_HOST_PTR, CL_MEM_COPY_HOST_PTR 

Copy Between Image and 

Buffer Objects [5.2.7] 

djnt clEnqueueCopylmageToBuffer ( 

cl_command_queue command_queue, 
cl_mem srcjmage, cl_mem dst_buffer, 
const size J src_or/g/n[3], const sizej region[ 3], 
size_t dst_offset, cl_uint num_events_in_wait_list, 
const cl_event *event_wait_list, cl_event * event ) 

cljnt clEnqueueCopyBufferTolmage ( 

cl_command_queue command_queue, 

cl_mem src_buffer, cl_mem dstjmage, 

sizej src_offset, const sizej dst_origin[ 3], 

const sizej reg/on[3], 

cl_uint num_events_in_waitjist, 

const cl_event *event_wait_list, cl_event * event) 


Map and Unmap Image Objects [5.2.8] 
void * dEnqueueMapImage ( 

cljommand jjueue command_queue, 
cl jnem image, cl_bool blocking_map, 
cl_map_flags mapJlags, const sizej origin) 3], 
const sizej reg/'on[3], sizej *image_row_pitch, 
sizej *image_slice_pitch, 
cl_ui nt num_eventsjn_ waitjist, 
const cl_event *event_wait_list, 
cl_event *event, cljnt *errcode_ret) 

Read, Write, Copy Image Objects [5.2.6] 

cljnt dEnqueueReadlmage ( 

cl_command_queue command_queue, 
cl_mem image, cl_bool blocking_read, 
const sizej orig/n[3], const sizej region) 3], 
sizej row_pitch, sizej slice_pitch, void *ptr, 
cljiint num_eventsjn_wait_list, 
const cl_event *event_waitjist, cl_event *event ) 
cljnt dEnqueueWritelmage ( 

cl_command_queue command_queue, 
cl_mem image, cl_bool blocking_write, 
const sizej orig/n[3], const sizej reg/on[3], 
sizej input_row_pitch, sizej input_slice_pitch, 
const void *ptr, cl_uint num_eventsjn_waitjist, 
const cl_event * eventj/vaitjist, cl_event *event) 
cljnt clEnqueueCopylmage ( 

cl_command_queue command_queue, 

cljmem srcjmage, cl_mem dstjmage, 

const sizej src_origin[ 3], const sizej dst_origin) 3], 

const sizej reg/on[3], 

cluint num_eventsjn_ waitjist, 

const cl_event *event_waitjist, cl_event *event) 

Query Image Objects [5.2.9] 
cljnt cIGetMemObjectlnfo (d_mem memobj, 
cl jnemjnfo param_name, 
sizej param_value_size, void *param_value, 
sizej *param_value_size_ret) 
param_name: CL_MEMJYPE, 

CL_MEM JLAGS, CL_MEM_SIZE, 

CL_MEM_HOSTJTR, CL_MEM_MAP_COUNT, 

CL_MEMJEFERENCE_COUNT, CL_MEM_CONTEXT 

cljnt dGetlmagelnfo (d_mem image, 
cl JmageJnfo param_name, 
size t param_value_size, void *param_value, 
sizej *param_value_size_ret) 
paramname: CLJMAGEJORMAT, 

CLJMAGE JLEMENT_SIZE, CLJMAGE_ROW PITCH, 
CL_IMAGE_SLICE PITCH, CLJMAGEJIEIGHT, 

CL_IMAGE_WIDTH, CLJMAGEDEPTH 


Sampler Objects [5.3] 

I cl_sampler cICreateSampler (cljontext context, 
cl_bool normalized_coords, 
cl_addressing_mode addressingmode, 
c\J\ter_mode filter_mode, cljnt *errcode_ret) 

\ cljnt dRetainSampler (cjsampler sampler) 
cljnt cIReleaseSampler (cl_sampler sampler) 


cljnt cIGetSamplerlnfo (cl_sampler sampler, 
cl_sampler_info param_name, 
sizej param_value_size, void *param_value, 
sizej *param_value_size_ret ) 
param_value: CL_SAMPLER_REFERENCE_COUNT, 
CL_SAM PLE R_CO NTEXT, 
CL_SAMPLER_FILTER_MODE, 
CL_SAMPLER_ADDRESSING_MODE, 
CL_SAMPLER_NORMALIZED_COORDS 


Sampler Declaration Fields [6.11.8.1] 

| The sampler can be passed as an argument to 
I the kernel using dSetKernelArg, or it can be a 
constant variable of type samplerj declared 
in the program source. 

I const samplerj <sampler-name> = 

<normalized-mode> | <address-mode> | 
<filter-mode> 
normalized-mode: 

CLK_NORMALIZED_COORDSJRUE, 
CLK_NORMALIZED_COORDS_FALSE 
| filter-mode: 

CLK_FILTER_NEAREST, CLK_FILTER_LINEAR 
[ address-mode: 

CLK_ADDRESS_REPEAT, 
CLK_ADDRESS_CLAMPJO_EDGE, 
CLK_ADDRESS_CLAMP, CLK_ADDRESS_NONE 


Image Access Qualifiers [6.6] 

Apply to image image2d J and image3d J 
types to declare if the image memory object is 
being read or written by a kernel. 

_read_only, read_only 
_write_only, write_only 


Image Formats [5.2.4.1,9.8] 

Supported image formats: image_channel_order with 
image jhanneljlatajype. 

Built-in support: 


CL_RGBA: CL_HALF_FLOAT, CLJLOAT, CL_UN0RM_INT{8116}, 
CL_SIGNEDJNT{8116132}, CL_UNSIGNEDJNT{8116132} 


CL BGRA: CL UN0RMJNT8 


Optional support: 


CL_R, CL_A: CL_HALFJLOAT, CL_FLOAT, 
CL_UN0RM_INT{8116}, CL_SIGNEDJNT{8116132}, 
CL UNSIGNED INT{8116132}, CL SN0RM INT{8116} 


CL INTENSITY: CL_HALF_FLOAT, CLJLOAT, 
CLJJNORMJNT{8| 16}, CL SNORMJNT{8116} 


CLJUMINANCE: CL_UNORMJNT{8|16}, CL_HALF_FLOAT, 
CLJLOAT, CL SNORMJNT{8116} 


CL RG, CL RA: CL_HALF_FLOAT, CL_FLOAT, 

CL UN0RMJNT{8| 16}, CL_SIGNED_INT{8116132}, 

CL UNSIGNED INT{8116132}, CL SNORM INT{8|16} 


©2009 Khronos Group - Rev. 1109 
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Image Read and Write Built-in Functions [6.11.8,9.8] 

The built-in functions defined in this section can only be used with image memory objects created with 

c!Createlmage2D or clCreatelmage3D. opt = Optional function. 


float4 read_imagef (image2d J image, sampler_t sampler, int2 coord) 
float4 read_imagef (image2d J image, samplerj sampler, float2 coord) 
int4 read_imagei (image2d J image, samplerj sampler, int2 coord) 
int4 read_imagei (image2dj image, samplerj sampler, float2 coord) 
unsigned int4 read_imageui (image2d J image, samplerj sampler, int2 coord) 
unsigned int4 read imageui (image2dj image, samplerj sampler, float2 coord) 

Read an element from a 2D image. 
sampler specifies the addressing and 
filtering mode to use. 

half4 read Jmageh (image2dj image, samplerj sampler, int2 coord ) opt 

half4 read Jmageh (image2d J image, samplerj sampler, float2 coord) opt 

void writejmagef (image2dj image, int2 coord, float4 color ) 
void writejmagei (image2d J image, int2 coord, int4 color) 
void writejmageui (image2dj image, int2 coord, unsigned int4 color) 

Write color value to (x, y) location 
specified by coord in the 2D image 

void writejmageh (image2d J image, int2 coord, half4 color) opt 

float4 readJmagef (image3dJ image, samplerj sampler, int4 coord) 
float4 read_imagef (image3dj image, sampler _t sampler, float4 coord) 
int4 read_imagei (image3d J image, samplerj sampler, int4 coord) 
int4 read_imagei (image3dJ image, samplerj sampler, float4 coord) 
unsigned int4 read_imageui (image3dj image, samplerj sampler, int4 coord) 
unsigned int4 read_imageui (image3d J image, samplerj sampler, float4 coord) 

Read an element from a 3D image. 
sampler specifies the addressing and 
filtering mode to use. 

half4 read_imageh (image3d J image, samplerj sampler, int4 coord) opt 

half4 readjmageh (image3d J image, samplerj sampler, float4 coord) opt 

int get_image_width (image2dj image) 
int getjmage_width (image3d J image) 

2D or 3D image width in pixels 

int getjmagejieight (image2d J image) 
int get_image_height (image3d J image) 

2D or 3D image height in pixels 

int get image depth (image3dj image) 

3D image depth in pixels 

int get_image_channel_data Jype (image2d J image) 
int get image channel data Jype (image3d J image) 

image channel data type 

int getJmagejhanneljjrder (image2d J image) 
int getJmagejhannel order (image3d J image) 

image channel order 

int2 get_image_dim (image2d J image) 

2D image width and height 

int4 get_image_dim (image3d J image ) 

3D image width, height, and depth 

void writejmageh (image3d J image, int4 coord, half4 color) opt 

Writes color value to (x, y, z) location 
specified by coord in the 3D image. 

void writejmagef (image3d J image, int4 coord, float4 color) opt 

void writejmagei (image3d J image, int4 coord, int4 color) opt 

void writejmageui (image3dj image, int4 coord, unsigned int4 color) opt 

Writes color at coord in the 3D image. 

Include this pragma to enable these 
functions: 

#pragma OPENCL EXTENSION 
cl_khr_3djmage_writes: enable 



OpenCL/OpenGL Sharing APIs [Appendix B] 


Creating OpenCL memory objects from OpenGL objects using the functions dCreateFromGLBuffer, 
clCreateFromGLTexture2D, clCreateFromGLTexture3D, or dCreateFromGLRenderbuffer ensures that the 
underlying storage of that OpenGL object will not be deleted while the corresponding OpenCL memory 
[ object still exists. (Items shown in red are optional) 


CL Buffer Objects > GL Buffer Objects [b.i.i] 

| cljnem dCreateFromGLBuffer (cl context context, 
cl_mem_flags flags, GLuint bufobj, 
int *errcode_ret) 

flags: CL_MEM_{READ_ONLY, CL_MEM_WRITE_ONLY, 
CL_MEM_READ_WRITE 

CL Image Objects > GL Textures [B.1.2] 
cl_mem dCreateFromGLTexture2D ( 

cl_context context, cl_mem_flags flags, 

GLenum target, GLint miplevel, GLuint texture, 
int *errcode_ret) 

flags: (Same as for dCreateFromGLBuffer) 
target: GL_TEXTURE_2D, GL_TEXTURE_RECTANGLE, 
GL_TEXTURE_CUBE_MAP_POSITIVE_(X | Y | Z}, 
GL_TEXTU RE_CU BE_M AP_N EGATIVE_{X | Y | Z) 

| cl mem clCreateFromGLTexture3D ( 

cl_context context, cl_mem_flags flags, 

GLenum target, GLint miplevel, GLuint texture, 
int *errcode_ret) 

flags: (Same as for dCreateFromGLBuffer) 
target: GL_TEXTURE_3D 

CL Image Objects > GL Renderbuffers [b.i.3 ] 
[ cl_mem dCreateFromGLRenderbuffer ( 

cl_context context, d_mem_f!ags flags, 

GLuint renderbuffer, int *errcode_ret) 
flags: (Same as for dCreateFromGLBuffer) 

Query Information [b.i.4 ] 
cljnt dGetGLObjectlnfo (cl_mem memobj, 
dj;l_object_type *gl_object_type, 

GLuint *gl_object_name) 
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gl_object_type: CLGLOBJECTBUFFER, 
CL_GL_OBJECT_TEXTURE2D, 

CL_GL_OBJECT TEXTURE_RECTANGLE, 
CL_GL_OBJECT_TEXTURE3D, 
CL_GL_OBJECT_RENDERBUFFER 
cljnt cIGetGLTexturelnfo (cl_mem memobj, 
cl_gl_texture_info param_name, 
size_t param_value_size, void *param_value, 
size t *param_value_size_ret) 
param_ name: CL_GL_TEXTU RE_TARG ET, 

CL_G L_M IP M AP JEVEL 

Share Objects [B.1.5] 

cljnt dEnqueueAcquireGLObjects ( 

cl_command_queue command_queue, 
cl_uint num_objects, const cl_mem *mem_objects, 
cl_uint num_eventsjn_wait_list, 
const cl_event *event_wait_list, cl_event * event) 
cljnt dEnqueueReleaseGLObjects ( 
d_command_queue command_queue, 
cl_uint numobjects, 
const cl_mem *mem_objects, 
cl_uint num_events_in_waitjist, 
const cl_event *event_wait_list, cl_event *event) 

Querying CL Devices in GL Context [9.11] 
cljnt cIGetGLContextlnfoKHR ( 

const ct_context_properties * properties, 
cl^gl_contextJnfo param_name, 
size_t param_value_size, void *param_value, 
size_t *param_value_size_ret) 
param_name: CL_DEVICES_FOR_GL_CONTEXT_KHR, 
CL_CURRENT_0EV1CE_F0R_GL_C0NTEXT_KHR 
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